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(57) Abstract 

An immunogenic detoxified protein 
comprising the amino acid sequence of subunit 
A of cholera toxin (CT-A) or subunit A of an 
Escherichia coli heat labile toxin (LT-A) or a 
fragment thereof wherein one or more amino 
acids at, or in positions corresponding to Val- 
53, Ser-63, Val-97, Tyr-104 or Pro-106 are re- 
placed with another amino acid or deleted. Ex- 
amples of specific replacements include Val- 
53-Asp, Val-53-Glu, Val-53-Tyr, Ser-63-Lys, 
Val-97-Lys, Val-97-Tyr, Tyr-104-Lys, Tyr- 
104-Asp, Tyr-104-Ser, Pro-106-Ser. The immu- 
nogenic detoxified protein is useful as vaccine 
for Vibrio cholerae or an enterotoxigenic strain 
of Escherichia coli and is produced by recom- 
binant DNA means by site-directed mutagene- 
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Immunogenic detoxified mutants of cholera toxin and of the 
toxin LT, their preparation and their use for the 
preparation of vaccines 

5 Field of the Invention 

The present invention relates to immunogenic detoxified 
proteins of cholera toxins (CT) , or of heat labile toxins 
(LT) produced by the enterotoxigenic strains of Escherichia 

10 coli (E.coli) having substitutions at one or more of amino 
acids Val-53, Ser-63, Val-97, Tyr-104 or Pro-106 and to 
their use in vaccines which are useful for the prevention or 
treatment of cholera or enterotoxigenic E.Coli infections. 
The proteins can be suitably produced using recombinant DNA 

15 techniques by site-directed mutagenesis of DNA encoding the 
wild type toxins. 

Background to the Invention 

20 Cholera is a contagious disease widely distributed in the 
world, in particular in the Third World, where, in certain 
areas, it is endemic. The serious disorders which develop in 
the intestinal system prove fatal in a high percentage of 
the recorded cases of the disease. 

25 

The etiological agent of cholera is the Gram-negative 
microorganism Vibrio cholerae (V . cholerae) . This colonises 
the intestinal tract of individuals who have come into 
contact with it through ingestion of contaminated food or 

30 water, and multiplies to very high concentrations. The 
principal symptom is severe diarrhoea as a result of which 
the patient can lose as much as 10-15 litres of liquids per 
day via the faeces. As a result of the severe dehydration 
and loss of electrolytes, the patient does not withstand the 

35 infection in 50-60% of cases, and dies. The diarrhoea caused 
by V. cholerae is due to the secretion of cholera toxin, CT, 
which acts by stimulating the activity of the adenylate 
cyclase enzyme so as to induce disturbances at cell level. 
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Although cholera can be effectively cured by controlled and 
intense rehydration, the distribution of a vaccine is 
desirable with a view to complete control and future 
eradication of the disease. 

5 

At the present time, there exists a vaccination against 
cholera, consisting of parenteral administration of killed 
bacteria. Although some countries insist on vaccination 
against the disease, there are serious doubts as to its real 
10 usefulness, given that the current cellular vaccine protects 
against the consequences of the infection in only 50% of the 
cases and that the protection is also extremely limited in 
duration, to less than 6 months. 

15 In Bangladesh, an experimental trial is in progress (1990- 
92) of an oral vaccine consisting of killed bacteria with 
the addition of subunit B of cholera toxin, which is known 
to be highly immunogenic. This product succeeds in inducing 
lasting protection, without special side effects (Holmgren 

20 J., Clemens J., Sack DA., Sanchez J. and Svennerholm AM; 
"Oral Immunization against cholera" Curr. Top. Microbiol. 
Immunol. (1988), 146, 197-204). 

Cholera toxin resembles the heat labile toxins of 
25 enterotoxigenic strains of Escherichia coli in amino acid 
sequence, structure and mode of action. 

The consequences of infection with an enterotoxigenic strain 
of E.coli are similar to, though less serious than, those of 
30 cholera, and consist of severe diarrhoea and intestinal 
disorders . 

The CT and LT toxins all comprise a single A subunit (or 
protomer A) responsible for the enzymic activity of the 
35 toxin (herein CT-A or LT-A) and five identical B subunits 
(or protomer B) which are involved in the binding of the 
toxin to the intestinal epithelial cells (herein CT-B or LT- 
B). 
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3 

The A subunit penetrates the cell membrane and causes 
activation of adenylate cyclase by NAD-dependent ADP- 
ribosylation of a GTP-binding protein which controls the 
5 activity of the enzyme. The clinical effect of this is to 
cause massive fluid loss into the intestine. 

Considerable research has been conducted on cholera toxin 
and the E. coli heat labile toxins. 

10 

The sequence of CT is known and has been described 
(Mekalanos J.J. et al Nature 306 , page 551 (1983)). 

The sequence of LT from enterotoxigenic strains of E.coli 
15 is, as mentioned, 80% homologous to CT and it too is known 
and described in the scientific literature. Spicer E.K. et 
al (Biol. Chem. 257 p. 5716-5721 (1982)) describe the amino 
acid sequence of the A sub unit of the heat labile toxin 
from an enterotoxigenic strain of E. coli found in pigs. 

20 

A bacterial chromosomal form of LT has been identified and 
sequenced by Pickett C.L. et al (J. Bacterid. 169 , 5180- 
5187, (1987) . 

25 The sequence of the A subunit of LT from a strain of E. coli 
known to affect humans has also been sequenced (Yamamoto et 
al, J. Biol. Chem., 259, 5037-5044, (1984)). 

In view of the potential clinical significance of a vaccine 
30 against cholera and enterotoxigenic bacteria there is a 
continuing and great interest in producing a detoxified 
toxin capable of immunising against cholera and 
enterotoxigenic bacteria. The techniques of genetic 
engineering allow specific mutations to be introduced into 
35 the genes encoding the toxins and the production of the 
mutated toxins using now conventional techniques of gene 
expression and protein purification. 
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Various groups have attempted to identify mutations of the 
genes, which involve loss of the toxicity characteristics of 
the encoded proteins. The studies are predominantly being 
carried out in respect of the gene for the toxin LT, from E. 
5 coli . | 

Harford, S. et al (Eur. J. Biochem. 183, page 311 (1989)) 
describe the production of a toxoid by in vitro mutagenesis 
of the LT-A gene from E.coli pathogenic for pigs. The 

10 resulting successful mutation contained a Ser-61-Phe 
substitution and a Gly-79-Lys substitution, the former being 
considered the more important. Harford et al suggest that, 
because of the similarities between the LT-A genes in E.coli 
pathogenic to humans and pigs and the CT-A gene, and because 

15 the toxins are thought to operate by a common mechanism, it 
may be possible to produce a cholera holotoxoid by 
introducing the Ser-61-Phe mutation into the CT-A gene. 

Tsuji, T. et al (J. Biol. Chem. 265, p. 22520 (1990)) 
20 describe the mutation of the LT-A gene from plasmid EWD299 
to produce a single substitution Glu-112-Lys which affects 
the toxicity of the mutant LT yet does not change the 
immunogenicity of the protein. 

25 Grant, C.C.R. et al (Abstract B289 of the 92nd General 
Meeting of the American Society for Microbiology, 2 6-3 0th 
May 1992) describe conservative substitutions of histidines 
at 44 and 70 and tryptophan at 127 in LT-A which result in 
significant reductions in enzymic activity. 

30 

Some work has been conducted on mutations to CT. 

Kaslow, H.R. et al (Abstract B291 of the 92nd General 
Meeting of the American Society for Microbiology, 26-3 0th 
35 May 1992) describe mutating Asp-9 and His-44 and truncating 
after amino acid 180 in CT-A which all essentially eliminate 
activity. Mutating Arg-9 is said to markedly attenuate 
activity. Mutating other amino acid sites had little effect 
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on toxicity. 

Burnette, W.N. et al (Inf. and Immun. 59(11) . 4266-4270, 
(1991)) describe site-specific mutagenesis of CT-A to 
5 produce an Arg-7-Lys mutation paralleling that of a known 
detoxifying mutation in the A subunit of the Bordetella 
pertussis toxin. The mutation resulted in the complete 
abolition of detectable ADP-ribosyltransf erase activity. 

10 International patent application WO 92/19265 (Burnette, 
Kaslow and Amgen Inc.) describes mutations of CT-A at Arg- 
7 , Asp-9, Arg-11, His-44, His-70 and Glu-112. 

Mutations at Glu-110 (LT and CT) and Arg-146 (LT) have also 
15 been described in the literature (Lobet, Inf. Immun., 2870, 
1991; Lai, Biochem. Biophys. Res. Comm. 341 1983; Okamoto J. 
Bacteriol. 2208, 1988). 

The crystal structure of LT has been determined by Sixma et 
20 al (Nature, 351, 371-377, May 1991) and confirms the 
mutatagenesis results described earlier in the literature, 
explaining structurally the significance of Glu-112 and Ser- 
61 in activity of the A sub unit and suggesting that His- 
44, Ser-114 and Arg-54 which are in the immediate 
25 neighbourhood may be important for catalysis or recognition. 

Summary of the invention 

It has now been discovered by further and more detailed 
30 analysis of the structure of the toxins that certain further 
amino acids in the sequences of CT-A and LT-A are in 
positions capable of decreasing the enzymatic activity of 
CT and LT when mutated suitably, individually or in 
conjunction with other mutations. 

35 

The object of the present invention is to provide a vaccine 
which gives total protection against cholera or 
enterotoxigenic B. coli, by means of a second generation 
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product consisting of a single antigen, a toxoid derived 
from CT or LT, which has been detoxified genetically. 

The genetic detoxification of CT or LT retains the 
5 immunogenic properties of the toxoid whilst providing a 
significantly reduced and preferably absent toxicity. 

According to a first aspect of the invention there is 
provided an immunogenic detoxified protein comprising the 

10 amino acid sequence of subunit A of a cholera toxin (CT-A) 
or a fragment thereof or subunit A of an Escherichia coli 
heat labile toxin (LT-A) or a fragment thereof, wherein one 
or more amino acids at, or in positions corresponding to 
Val-53, Ser-63, Val-97, Tyr-104 or Pro-106 are replaced with 

15 another amino acid. 

The replaced amino acids are at locations in the sequences 
of CT-A or an LT-A which are conserved both in the amino 
acid sequence and structurally and are thus common to CT 
20 and the various LTs. 

The immunogenic detoxified protein of the invention adopts 
substantially the same structural conformation as the wild 
type naturally occuring toxins. It is immunologically 
25 active and cross reacts with antibodies to the wild type 
toxins . 

In this specification, references to CT and LT encompass the 
various naturally occurring strain variants as- well other 
30 variants encompassing changes from the sequences disclosed 
herein which do not affect the immunogenicity of the 
assembled toxoid. 

In this specification, references to amino acid coordinates 
35 such as 11 Val-97" connote the amino acid at that position in 
the sequence of the mature cholera toxin subunit A (CT-A) , 
that is without the signal sequence (see Figure 1) . 
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Where the specification refers to an LT-A, the amino acid 
coordinates refer to the corresponding position in CT-A as 
shown in Figure 1. 

5 Thus, for example, Val-53 in CT corresponds to Val-52 in the 
LT1 subunit and Ser-63 in CT corresponds to Ser-62 in LT1, 
there being a single amino acid difference in numbering up 
to amino acid 89 of the LT1 sequence. Val-97 in the CT 
sequence corresponds to Val-93 in the LT1 sequence because 
10 of the four amino acid difference at that point in the 
sequence . 

In addition, the immunogenic detoxified protein of the 
invention may include other mutations such as, for example, 
15 substitutions at one or more of Arg-7, Asp-9, Arg-11, His- 
44, Arg-54, Ser-61, His-70, His-107, Glu-110, Glu-112, Ser- 
114, Trp-127, Arg-14 6 or Arg-192. 

The amino acid substituted for the wild type amino acid may 
20 be a naturally occurring amino acid or may a modified or 
synthetic amino acid. The substitution may involve deletion 
of an amino acid altogether provided that the mutant retains 
the necessary immunogenic properties and exhibits a 
substantially reduced toxicity. 

25 

Substitutions which alter the amphotericity and 
hydrophilicity whilst retaining the steric effect of the 
substituting amino acid as far as possible are generally 
preferred . 

30 

Preferred substitutions include: Val-53 -Asp, Val-53-Glu, 
Val-53-Tyr, Ser-63-Lys, Val-97-Lys, Val-97-Tyr, His-107-Glu, 
Tyr-104-Lys, Tyr-104-Asp, Tyr-104-Ser, Pro-106-Ser, Ser-114-. 
Glu, Ser-114-Lys. 

35 

As used herein, the term "detoxified" means that the 
immunogenic composition exhibits a substantially lower 
toxicity relative to its naturally occurring toxin 
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counterpart. The substantially lower toxicity should be 
sufficiently low for the protein to be used in an 
immunogenic composition in an immunologically effective 
amount as a vaccine with causing significant side effects. 
5 For example, the immunogenic detoxified protein should have 
a toxicity of less than 0.01% of the naturally occurring 
toxin counterpart. The toxicity may be measured in mouse 
CHO cells or preferably by evaluation of the morphological 
changes induced in Yl cells. The term "toxoid" means a 
10 genetically detoxified toxin. 

The immunogenic protein may be a CT or LT subunit A toxoid, 
but is preferably an assembled toxin molecule comprising a 
mutated CT-A or LT-A subunit and five B subunits of CT or 
15 LT. The B subunit may be a naturally occurring subunit or 
"may itself be mutated. 

The immunogenic protein is preferably a naturally occurring 
CT-A or an LT-A suitably modified as described above. 

20 However, conservative amino acid changes may be made which 
do not affect the immunogenic ity or the toxicity of 
immunogenic protein and preferably do not affect the ability 
of the immunogenic protein to form complete toxin with B 
subunit protein. Also, the immunogenic protein may be a 

25 fragment of CT-A or an LT-A provided that the fragment is 
immunogenic and non toxic and contains at least one of the 
conserved regions containing one of the mutations according 
to the invention. 

30 According to a second aspect of the invention, there is 
provided an immunogenic composition for use as a vaccine 
comprising an immunogenic detoxified protein of the first 
aspect of the invention and a pharmaceutical^ acceptable 
carrier. 

35 

The immunogenic composition may additionally contain one or 
more adjuvants and/or pharmaceutically acceptable diluents. 
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The invention also provides a vaccine composition comprising 
an immunogenic detoxified protein according to the first 
aspect of the invention and a pharmaceutical ly acceptable 
carrier. The vaccine composition may further comprise an 
5 adjuvant. 

According to a third aspect of the invention, there is 
provided a method of vaccinating a mammal against ViJbrio 
cholerae or an enterotoxigenic strain of Escherichia coli 
10 comprising administering an immunologically effective amount 
of an immunogenic detoxified protein according to the first 
aspect of the invention. 

The immunogenic detoxified proteins of the invention may be 
15 synthesised chemically using conventional peptide synthesis 
techniques, but are preferably produced by recombinant DNA 
means . 

According to a fourth aspect of the invention there is 
20 provided a DNA sequence encoding an immunogenic detoxified 
protein according to the first aspect of the invention. 

Preferably the DNA sequence contains a DNA sequence encoding 
a complete CT or LT comprising DNA encoding both the 
25 detoxified subunit A and subunit B in a polycistronic unit. 
Alternatively, the DNA may encode only the detoxified 
subunit A. 

According to a fifth aspect of the invention, there is 
30 provided a vector carrying a DNA according to the fourth 
aspect of the invention. 

According to a sixth aspect of the invention, there is 
provided a host cell line transformed with .„ the vector 
35 according to the fifth aspect of the invention. 

The host cell may be any host capable of producing CT or LT 
but is preferably a bacterium, most suitably E.coli or 
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V^cholerae suitable engineered to produce the desired 
immunogenic detoxified protein. 

In a further embodiment of the sixth aspect of the 
5 invention, the host cell may itself provide a protective 
species , for example an E.coli or V.cholerae strain mutated 
to a phenotype lacking wild type LT or CT and carrying and 
expressing an immunogenic detoxified protein of the first 
aspect of the invention. 

10 

In a further embodiment of the sixth aspect of the invention 
the host cell is capable of expressing a chromosomal LT-A 
gene according to the first aspect of the invention. 

15 According to a seventh aspect of the invention, there is . 
provided a process for the production of an immunogenic 
detoxified protein according to the first aspect of the 
invention comprising culturing a host cell according to the 
sixth aspect of the invention. 

20 

According to a eighth aspect of the invention there is 
provided a process for the production of DNA according to 
the fourth aspect of the invention comprising the steps of 
subjecting a DNA encoding a CT-A or an LT-A or a fragment 
25 thereof to site-directed mutagenesis. 

According to a ninth aspect of the invention there is 
provided a process for the formulation of a vaccine 
comprising bringing an immunogenic detoxified protein 
30 according to the first aspect of the invention into 
association with a pharmaceutically acceptable carrier and 
optionally with an adjuvant. 

Industrial Applicability 

35 

The immunogenic detoxified protein of the invention 
constitutes the active component of a vaccine composition 
useful for the prevention and treatment of cholera 
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infections or infections by enterotoxigenic strains of 
E.coli. The compositions are thus applicable for use in the 
pharmaceutical industry. 

5 Brief Description of the Drawings 

Figure 1 shows the amino acid sequences of the wild type 
subunit A from: 

10 i) cholera toxin (CT - Mekalanos et al op cit) , 

ii) heat labile toxin from an E.coli strain found in man 
(LT1_1 - Yamamoto et al op cit) 

iii) heat labile toxin from an E.coli strain found in pigs 
(LT1- Spicer et al op cit) , and 

15 iv) heat labile toxin from a chromosomal source (LT1_1 - 
Pickett et al op cit) 

The signal sequences are not shown* 

20 In Figure 1, the conventional single letter amino acid code 
is used. The symbol 11 . 11 denotes an absent amino acid and 
acts as a typographical spacer to ensure that the sequences 
remain in alignment for ease of comparison. The symbol ,! - 
11 indicates an amino acid in the sequences of LT1 and LT2 

25 which is identical to the corresponding amino acid in CT. 
The numbers against each line are the amino acid number of 
the first amino acid on that line. 

In Figure 1 the positions of the mutations of the present 
30 invention are shown underlined. 

Figures 2a and 2b are comparisons of the amino acid and DNA 
sequences of the A sub units of LT1 and CT. 

35 Figure 3 is a restriction map of plasmid EWD299 (Dallas et 
al) , bearing the LT-A gene. 

Detailed Description of Embodiments of the Invention 
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The practice of the present invention will employ, unless 
otherwise indicated, conventional techniques of molecular 
biology, microbiology, recombinant DNA, and immunology, 
5 which are within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Sambrook, et 
al. , MOLECULAR CLONING; A LABORATORY MANUAL, SECOND EDITION 
(1989) ; DNA CLONING, VOLUMES I AND II (D.N Glover ed. 

1985) ; OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); 
10 NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds. 

1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. 
Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 

1986) ; IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (1984) ; the 

15 series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); GENE 
TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Miller and M.P. 
Calos eds. 1987, Cold Spring Harbor Laboratory), Methods in 
Enzymology Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, 
eds., respectively), Mayer and Walker, eds. (1987), 

20 IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION; PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES I-IV (D.M. Weir and C. C. Blackwell eds 

25 1986) . 

Standard abbreviations for nucleotides and amino acids are 
used in this specification. All publications , patents, and 
patent applications cited herein are incorporated by 
30 reference. 

In particular, the following amino acid abbreviations are 
used: 

* 

35 Alanine A Ala 

Arginine R Arg 

Asparagine N Asn 

Aspartic Acid D Asp 
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Cysteine 


c 


Cys 




Glycine 


G 


Gly 




Glutamic Acid 


E 


Glu 




Glutamine 


Q 


Gin 


5 


Histidine 


H 


His 




Isoleucine 


I 


He 




Leucine 


L 


Leu 




Lysine 


K 


Lys 




Methionine 


M 


Met 


10 


Phenylalanine 


F 


Phe 




Proline 


P 


Pro 




Serine 


S 


Ser 




Threonine 


T 


Thr 




Tryptophan 


W 


Trp 


15 


Tyrosine 


Y 


Tyr 




Valine 


V 


Val 



As mentioned above examples of the immunogenic detoxified 
protein that can be used in the present invention include 
20 polypeptides with minor amino acid variations from the 
natural amino acid sequence of the protein other than at the 
sites of mutation specifically mentioned. 

A significant advantage of producing the immunogenic 
25 detoxified protein by recombinant DNA techniques rather than 
by isolating and purifying a protein from natural sources is 
that equivalent quantities of the protein can be produced by 
using less starting material than would be required for 
isolating the protein from a natural source. Producing the 
30 protein by recombinant techniques also permits the protein 
to be isolated in the absence of some molecules normally 
present in cells. Indeed, protein compositions entirely 
free of any trace of human protein contaminants can readily 
be produced because the only human protein produced by the 
35 recombinant non-human host is the recombinant protein at 
issue. Potential viral agents from natural sources and 
viral components pathogenic to humans are also avoided. 
Also, genetically detoxified toxin are less likely to revert 
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to a toxic from than more traditional, chemically detoxified 
toxins . 

Pharmaceutical^ acceptable carriers include any carrier 
5 that does not itself induce the production of antibodies 
harmful to the individual receiving the composition. 
Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic 
acids, polyglycolic acids, polymeric amino acids, amino acid 
10 copolymers, lipid aggregates (such as oil droplets or 
liposomes) and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 
Additionally, these carriers may function as 
immunostimulating agents (adjuvants) . 

15 

Preferred adjuvants to enhance effectiveness of the compo- 
sition include, but are not limited to: aluminum salts 
(alum) such as aluminium hydroxide, aluminium phosphate, 
aluminium sulfate etc., oil emulsion formulations, with or 

20 without other specific immunostimulating agents such as 
muramyl peptides or bacterial cell wall components, such as 
for example (1) MF59 (Published International patent 
application WO-A-90/14837, containing 5% Sgualene, 0.5% 
Tween® 80, 0.5% Span® 85 (optionally containing various 

25 amounts of MTP-PE (see below) , although not required) 
formulated into submicron particles using a microf luidizer 
such as Model 110 Y microf luidizer (Microf luidics, Newton, MA 
02164), (2) SAF, containing 10% squalene, 0.4% Tween 80, 5% 
pluronic-blocked polymer L121, and thr-MDP (see below) 

30 either microf luidized into a submicron emulsion or vortexed 
to generate a larger particle size emulsion, and (3) RIBI™ 
adjuvant system (RAS) (Ribi Immunochem, Hamilton, MT) 
containing 2% Sgualene, 0.2% Tween® 80 and one or more 
bacterial cell wall components from the group consisting of 

35 monophosphoryl lipid A (MPL) , trehalose dimycolate (TDM) , 
and cell wall skeleton (CWS) preferably MPL+CWS (Detox™) , 
muramyl peptides such as N-acetyl-muramyl-L-threonyl-D- 
isoglutamine (thr-MDP) , N-acetyl-normuramyl-L-alanyl-D-iso- 
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glutamine (nor-MDP) , N-acetylmuramyl-L-alanyl-D- 
isoglutaminyl-L-alanine-2- (1 1 -2 1 -dipalmitoyl-sn-glycero-3- 
hydroxyphosphoryloxy) -ethylamine (MTP-PE) etc., and 
cytokines, such as interleukins (IL-1, IL-2 etc) macrophage 
5 colony stimulating factor (M-CSF) , tumour necrosis factor 
(TNF) etc. Additionally, saponin adjuvants, such as 
Stimulon™ (Cambridge Bioscience, Worcester, MA) may be used 
or particles generated therefrom such as ISCOMS 
(immunostimulating complexes) . Furthermore, Complete 
10 Freunds Adjuvant (CFA) and Incomplete Freunds Adjuvant (IFA) 
may be used. Alum and MF59 are preferred. 

The immunogenic compositions (e.g. the antigen, 
pharmaceutically acceptable carrier and adjuvant) typically 
15 will contain diluents, such as water, saline, glycerol, 
ethanol, etc. Additionally, auxiliary substances, such as 
wetting or emulsifying agents, pH buffering substances, and 
the like, may be present in such vehicles. 

20 Typically, the immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; 
solid forms suitable for solution in, or suspension in, 
liquid vehicles prior to injection may also be prepared. 
The preparation also may be emulsified or encapsulated in 

25 liposomes for enhanced adjuvant effect as discussed above 
under pharmaceutically acceptable carriers. 

Immunogenic compositions used as vaccines comprise an 
immunologically effective amount of the antigenic 

30 polypeptides, as well as any other of the above-mentioned 
components, as needed. By "immunologically effective 
amount 11 , it is meant that the administration of that amount 
to an individual, either in a single dose or as part of a 
series, is effective for treatment or prevention. This 

35 amount varies depending upon the health and physical 
condition of the individual to be treated, the taxonomic 
group of individual to be treated (e.g., nonhuman primate, 
primate,, etc.), the capacity of the individual's immune 
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system to synthesize antibodies, the degree of protection 
desired , the formulation of the vaccine , the treating 
doctor's assessment of the medical situation, and other rel- 
evant factors. It is expected that the amount will fall in 
5 a relatively broad range that can be determined through 
routine trials • 

The immunogenic compositions are conventionally administered 
parenterally, e.g. by injection either subcutaneous ly or 

10 intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary 
formulations , suppositories and transdermal applications. 
Dosage treatment may be a single dose schedule or a multiple 
dose schedule. The vaccine may be administered in conjunc- 

15 tion with other immunoregulatory agents. 

The term "recombinant polynucleotide 11 as used herein intends 
a polynucleotide of genomic, cDNA, semisynthetic, or 
synthetic origin which, by virtue of its origin or 
20 manipulation: (1) is not associated with all or a portion of 
a polynucleotide with which it is associated in nature, (2) 
is linked to a polynucleotide other than that to which it is 
linked in nature, or (3) does not occur in nature. 

25 The term "polynucleotide" as used herein refers to a 
polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyr ibonucleotides . This term refers 
only to the primary structure of the molecule. Thus, this 
term includes double- and single-stranded DNA and RNA. It 

30 also includes known types of modifications, for example, 
labels which are known in the art, methylation, "caps", 
substitution of one or more of the naturally occurring 
nucleotides with an analog, internucleotide modifications 
such as, for example, those with uncharged linkages (e.g., 

3 5 methyl phosphonates , phosphotr ies ter s , phosphoamidates , 
carbamates , etc . ) and with charged 1 inkages (e.g., 
phosphorothioates, phosphor odithioates , etc.), those 
containing pendant moieties, such as, for example proteins 



SUBSTITUTE SHEET 



WO 93/13202 



17 



PCIYEP92/03016 



(including for e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), those with intercalators 
(e.g., acridine, psoralen, etc.), those containing chelators 
(e.g., metals, radioactive metals, boron, oxidative metals, 
5 etc.), those containing alkylators, those with modified 
linkages (e.g., alpha anomeric nucleic acids, etc.) / as well 
as unmodified forms of the polynucleotide. 

A "replicon" is any genetic element, e.g., a plasmid, a 
10 chromosome, a virus, a cosmid, etc. that behaves as an 
autonomous unit of polynucleotide replication within a cell; 
i.e., capable of replication under its own control. This 
may include selectable markers. 

15 A "vector" is a replicon in which another polynucleotide 
segment is attached, so as to bring about the replication 
and/or expression of the attached segment. 

"Control sequence" refers to polynucleotide sequences which 
20 are necessary to effect the expression of coding sequences 
to which they are ligated. The nature of such control 
sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include 
promoter, ribosomal binding site, and transcription 
25 termination sequence; in eukaryotes, generally, such control 
sequences include promoters and transcription termination 
sequence. The term "control sequences" is intended to 
include, at a minimum, all components whose presence is 
necessary for expression, and may also include additional 
30 components whose presence is advantageous, for example, 
leader sequences and fusion partner sequences. 

"Operably linked" refers to a juxtaposition wherein the 
components so described are in a relationship .permitting 
35 them to function in their intended manner. A control 
sequence "operably linked" to a coding sequence is ligated 
in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control 
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sequences. 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
5 region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide .sequence which is 
translated into a polypeptide, usually via mRNA, when placed 

10 under the control of appropriate regulatory sequences. The 
boundaries of the coding sequence are determined by a 
translation start codon at the 5 '-terminus and a translation 
stop codon at the 3' -terminus. A coding sequence can 
include , but is not limited to, cDNA, and recombinant 

15 polynucleotide sequences. 

"PGR" refers to the technique of polymerase chain reaction 
as described in Saiki, et al., Nature 324:163 (1986); and 
Scharf et al. , Science (1986) 233:1076-1078; and U.S. 
20 4,683,195; and U.S. 4,683,202. 

As used herein, x is "heterologous" with respect to y if x 
is not naturally associated with y in the identical manner; 
i.e., x is not associated with y in nature or x is not 
25 associated with y in the same manner as is found in nature. 

"Homology" refers to the degree of similarity between x and 
y. The correspondence between the sequence from one form to 
another can be determined by techniques known in the art. 

30 For example, they can be determined by a direct comparison 
of the sequence information of the polynucleotide. 
Alternatively, homology can be determined by hybridization 
of the polynucleotides under conditions which form stable 
duplexes between homologous regions (for example, those 

35 which would be used prior to S 1 digestion) , followed by 
digestion with single-stranded specific nuclease (s) , fol- 
lowed by size determination of the digested fragments. 
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As used herein, the term "polypeptide" refers to a polymer 
of amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides , and proteins are 
included within the definition of polypeptide. This term 
5 also does not refer to or exclude post expression 
modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the like. 
Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino acid 
10 (including, for example, unnatural amino acids, etc.), 
polypeptides with substituted linkages, as well as other 
modifications known in the art, both naturally occurring and 
non-naturally occurring. 

15 A polypeptide or amino acid sequence "derived from" a 
designated nucleic acid sequence refers to a polypeptide 
having an amino acid sequence identical to that of a 
polypeptide encoded in the sequence, or a portion thereof 
wherein the portion consists of at least 3-5 amino acids, 

20 and more preferably at least 8-10 amino acids, and even more 
preferably at least 11-15 amino acids, or which is im- 
munologically identifiable with a polypeptide encoded in the 
sequence. This terminology also includes a polypeptide 
expressed from a designated nucleic acid sequence. 

25 

The protein may be used for producing antibodies, either 
monoclonal or polyclonal, specific to the protein. The 
methods for producing these antibodies are known in the art. 

30 "Recombinant host cells", "host cells," "cells," "cell 
cultures," and other such terms denote, for example, 
microorganisms, insect cells, and mammalian cells, that can 
be, or have been, used as recipients for recombinant vector 
or other transfer DNA, and include the progeny of the 

35 original cell which has been transformed. It is understood 
that the progeny of a single parental cell may not 
necessarily be completely identical in morphology or in 
genomic or total DNA complement as the original parent, due 



WO 93/13202 



20 



PCT/EP92/03016 



to natural, accidental, or deliberate mutation. Examples 
for mammalian host cells include Chinese hamster ovary (CHO) 
and monkey kidney (COS) cells. 

5 Specifically, as used herein, "cell line," refers to a 
population of cells capable of continuous or prolonged 
growth and division in vitro . Often, cell lines are clonal 
populations derived from a single progenitor cell. It is 
further known in the art that spontaneous or induced changes 

10 can occur in karyotype during storage or transfer of such 
clonal populations. Therefore, cells derived from the cell 
line referred to may not be precisely identical to the 
ancestral cells or cultures, and the cell line referred to 
includes such variants. The term "cell lines" also includes 

15 immortalized cells. Preferably, cell lines include 
nonhybrid cell lines or hybridomas to only two cell types. 

As used herein, the term "microorganism" includes 
prokaryotic and eukaryotic microbial species such as 
20 bacteria and fungi, the latter including yeast and 
filamentous fungi. 

"Transformation", as used herein, refers to the insertion of 
an exogenous polynucleotide into a host cell, irrespective 
25 of the method used for the insertion, for example, direct 
uptake, transduction, f -mating or electroporation . The 
exogenous polynucleotide may be maintained as a 
non-integrated vector, for example, a plasmid, or 
alternatively, may be integrated into the host genome. 

30 

By "genomic" is meant a collection or library of DNA 
molecules which are derived from restriction fragments that 
have been cloned in vectors. This may include all or part 
of the genetic material of an organism. 

35 

By "cDNA" is meant a complementary DNA sequence that 
hybridizes to a complementary strand of DNA. 
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By "purified" and "isolated" is meant, when referring to a 
polypeptide or nucleotide sequence, that the indicated 
molecule is present in the substantial absence of other 
biological macromolecules of the same type. The term 
5 "purified" as used herein preferably means at least 75% by 
weight, more preferably at least 85% by weight, more 
preferably still at least 95% by weight, and most preferably 
at least 98% by weight, of biological macromolecules of the 
same type present (but water, buffers, and other small 
10 molecules, especially molecules having a molecular weight of 
less than 1000, can be present) . 

Once the appropriate coding sequence is isolated, it can be 
expressed in a variety of different expression systems; for 
15 example those used with mammalian cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

20 Mammalian expression systems are known in the art. A 
mammalian promoter is any DNA sequence capable of binding 
mammalian RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) 
into mRNA. A promoter will have a transcription initiating 

25 region, which is usually placed proximal to the 5 1 end of 
the coding sequence, and a TATA box, usually located 25-30 
base pairs (bp) upstream of the transcription initiation 
site. The TATA box is thought to direct RNA polymerase II 
to begin RNA synthesis at the correct site. A mammalian 

30 promoter will also contain an upstream promoter element, 
usually located within 100 to 200 bp upstream of the TATA 
box. An upstream promoter element determines the rate at 
which transcription is initiated and can act in either 
orientation [Sambrook et al. (1989) "Expression of Cloned 

35 Genes in Mammalian Cells." In Molecular Cloning: A 
Laboratory Manual, 2nd ed.K 

Mammalian viral genes are often highly expressed and have a 
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broad host range; therefore sequences encoding mammalian 
viral genes provide particularly useful promoter sequences. 
Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter (Ad 
5 MLP) , and herpes simplex virus promoter. In addition, 
sequences derived from non-viral genes, such as the murine 
metallotheionein gene, also provide useful promoter 
sequences. Expression may be either constitutive or 
regulated (inducible), depending on the promoter can be 
10 induced with glucocorticoid in hormone-responsive cells. 

The presence of an enhancer element (enhancer) , combined 
with the promoter elements described above, will usually 
increase expression levels. An enhancer is a regulatory DNA 

15 sequence that can stimulate transcription up to 1000-fold 
when linked to homologous or heterologous promoters, with 
synthesis beginning at the normal RNA start site. Enhancers 
are also active when they are placed upstream or downstream 
from the transcription initiation site, in either normal or 

20 flipped orientation, or at a distance of more than 1000 
nucleotides from the promoter [Maniatis et al. (1987) 
Science 236: 1237; Alberts et al. (1989) Molecular Biology of 
the Cell . 2nd ed.]. Enhancer elements derived from viruses 
may be particularly useful, because they usually have a 

25 broader host range. Examples include the SV40 early gene 
enhancer [Dijkema et al (1985) EMBO J. 4:7611 and the 
enhancer /promoters derived from the long terminal repeat 
(LTR) of the Rous Sarcoma Virus [Gorman et al. (1982b) Proc. 
Natl. Acad. Sci. 79 :67771 and from human cytomegalovirus 

30 [Boshart et al. (1985) Cell 41 :5211. Additionally, some 
enhancers are regulatable and become active only in the 
presence of an inducer, such as a hormone or metal ion 

[Sassone-Corsi and Borelli (1986) Trends Genet. 2:215; 

Maniatis et al. (1987) Science 236:1237]. 

35 

A DNA molecule may be expressed intracellularly in mammalian 
cells. A promoter sequence may be directly linked with the 
DNA molecule, in which case the first amino acid at the N- 
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terminus of the recombinant protein will always be a 
methionine, which is encoded by the ATG start codon. If 
desired, the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

5 

Alternatively, foreign proteins can also be secreted from 
the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign 

10 protein in mammalian cells. Preferably, there are 
processing sites encoded between the leader fragment and the 
foreign gene that can be cleaved either in vivo or in vitro . 
The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct 

15 the secretion of the protein from the cell. The adenovirus 
triparite leader is an example of a leader sequence that 
provides for secretion of a foreign protein in mammalian 
cells. 

20 Usually, transcription termination and polyadenylation 
sequences recognized by mammalian cells are regulatory 
regions located 3* to the translation stop codon and thus, 
together with the promoter elements, flank the coding 
sequence. The 3 1 terminus of the mature mRNA is formed by 

25 site-specific post-transcriptional cleavage and polya- 
denylation [Birnstiel et al. (1985) Cell 41 :349: Proudfoot 
and Whitelaw (1988) "Termination and 3' end processing of 
eukaryotic RNA. In Transcription and splicing , (ed. B.D. 
Hames and D.M. Glover) ; Proudfoot (1989) Trends Biochem, 

30 Sci. 14 :105] . These sequences direct the transcription of 
an mRNA which can be translated into the polypeptide encoded 
by the DNA. Examples of transcription 

terminater/polyadenylation signals include those derived 
from SV4 0 [Sambrook et al (1989) "Expression of cloned genes 

35 in cultured mammalian cells." In Molecular Cloning : A 
Laboratory Manual] . 

Some genes may be expressed more efficiently when introns 
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(also called intervening sequences) are present. Several 
cDNAs, however, have been efficiently expressed from vectors 
that lack splicing signals (also called splice donor and 
acceptor sites) [see e.g., Gothing and Sambrook (1981) 
5 Nature 293 :6201. Introns are intervening noncoding 
sequences within a coding sequence that contain splice donor 
and acceptor sites. They are removed by a process called 
"splicing," following polyadenylation of the primary 
transcript [Nevins (1983) Annu. Rev. Biochem. 52 :441; Green 
10 (1986) Annu. Rev. Genet. 20 :671; Padgett et al. (1986) Annu. 
Rev. Biochem. 55 :1119; Krainer and Maniatis (1988) "RNA 
splicing." In Transcription and splicing (ed. B.D. Hames 
and D.M. Glover)]. 

15 Usually, the above described components, comprising a 
promoter, polyadenylation signal, and transcription 
termination sequence are put together into expression 
constructs. Enhancers, introns with functional splice donor 
and acceptor sites, and leader sequences may also be 

20 included in an expression construct, if desired. Expression 
constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable 
maintenance in a host, such as mammalian cells or bacteria. 
Mammalian replication systems include those derived from 

25 animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication 
systems of papovaviruses , such as SV40 [Gluzman (1981) Cell 
23:175] or polyomavirus , replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. 

30 Additional examples of mammalian replicons include those 
derived from bovine papillomavirus and Epstein-Barr virus. 
Additionally, the replicon may have two replication systems, 
thus allowing it to be maintained, for example, in mammalian 
cells for expression and in a procaryotic host for cloning 

35 and amplification. Examples of such mammalian-bacteria 
shuttle vectors include pMT2 [Kaufman et al. (1989) Mol. 
Cell. Biol. 9 :946 and pHEBO [Shimizu et al. (1986) MoJU 
Cell. Biol. 6 : 1074 ] . 
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The transformation procedure used depends upon the host to 
be transformed. Methods for introduction of heterologous 
polynucleotides into mammalian cells are known in the art 
5 and include dextran-mediated transf ection, calcium phosphate 
precipitation, polybrene mediated transf ection, protoplast 
fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection of 
the DNA into nuclei. 

10 

Mammalian cell lines available as hosts for expression are 
known in the art and include many immortalized cell lines 
available from the American Type Culture Collection (ATCC) , 
including but not limited to, Chinese hamster ovary (CHO) 
15 cells, HeLa cells, baby hamster kidney (BHK) cells, monkey 
kidney cells (COS) , human hepatocellular carcinoma cells 
(e.g., Hep G2) , and a number of other cell lines. 

« 

ii. Baculovirus Systems 

20 

The polynucleotide encoding the protein can also be inserted 
into a suitable insect expression vector, and is operably 
linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. 

25 

Generally, the components of the expression system include 
a transfer vector, usually a bacterial plasmid, which 
contains both a fragment of the baculovirus genome, and a 
convenient restriction site for insertion of the 

30 heterologous gene or genes to be expressed; a wild type 
baculovirus with a sequence homologous to the baculovirus- 
specific fragment in the transfer vector (this allows for 
the homologous recombination of the heterologous gene in to 
the baculovirus genome) ; and appropriate insect ^host cells 

35 and growth media. 

. After inserting the DNA sequence encoding the protein into 
the transfer vector, the vector and the wild type viral 
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genome are transfected into an insect host cell where the 
vector and viral genome are allowed to recombine. The 
packaged recombinant virus is expressed and recombinant 
plaques are identified and purified. Materials and methods 
5 for baculo virus/ insect cell expression systems are 
commercially available in kit form from, inter alia, 
Invitrogen, San Diego CA ("MaxBac" kit). These techniques 
are generally known to those skilled in the art and fully 
described in Summers and Smith, Texas Agricultural 
10 Experiment Station Bulletin No. 1555 (1987) (hereinafter 
"Summers and Smith") . 

Prior to inserting the DNA sequence encoding the protein 
into the baculovirus genome , the above described components, 

15 comprising a promoter, leader (if desired), coding sequence 
of interest, and transcription termination sequence, are 
usually assembled into an intermediate transplacement 
construct (transfer vector) . This construct may contain a 
single gene and operably linked regulatory elements; 

20 multiple genes, each with its owned set of operably linked 
regulatory elements; or multiple genes, regulated by the 
same set of regulatory elements. Intermediate 
transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., 

25 plasmids) capable of stable maintenance in a host, such as 
a bacterium. The replicon will have a replication system, 
thus allowing it to be maintained in a suitable host for 
cloning and amplification. 

30 Currently, the most commonly used transfer vector for 
introducing foreign genes into AcNPV is pAc373. Many other 
vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters 
the polyhedrin start codon from ATG to ATT, ,and which 

35 introduces a BamHI cloning site 32 basepairs downstream from 
the ATT; see Luckow and Summers, Virology (1989) 17:31. 

The plasmid usually also contains the polyhedrin 
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polyadenylation signal (Miller et al. (1988) Ann. Rev. 
Microbiol. , 42:177) and a procaryotic ampicillin-resistance 
( amp ) gene and origin of replication for selection and 
propagation in JL. coli . 

5 

Baculovirus transfer vectors usually contain a baculovirus 
promoter. A baculovirus promoter is any DNA sequence 
capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5 1 to 3') transcription of a 

10 coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which 
is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription 

15 initiation site. A baculovirus transfer vector may also 
have a second domain called an enhancer, which, if present, 
is usually distal to the structural gene. Expression may be 
either regulated or constitutive. 

20 Structural genes, abundantly transcribed at late times in a 
viral infection cycle, provide particularly useful promoter 
sequences. Examples include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et al., 
(1986) "The Regulation of Baculovirus Gene Expression, " in: 
. 25 The Molecular Biology of Baculoviruses (ed. Walter 
Doerf ler) ; EPO Publ. Nos. 127 839 and 155 476; and the gene 
encoding the plO protein, Vlak et al., (1988),. J. Gen. 
Virol. 69:765. 

30 DNA encoding suitable signal sequences can be derived from 
genes for secreted insect or baculovirus proteins, such as 
the baculovirus polyhedrin gene (Carbonell et al, (1988) 
Gene , 73:409). Alternatively, since the signals for 
mammalian cell posttranslational modif ications „ (such as 

35 signal peptide cleavage, proteolytic cleavage, and 
phosphorylation) appear to be recognized by insect cells, 
and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the 
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invertebrate cells and vertebrate cells, leaders of non- 
insect origin , such as those derived from genes encoding 
human a-interferon, Maeda et al., (1985) , Nature 315:592; 
human gastr in-releasing peptide, Lebacq-Verheyden et al., 
5 (1988), Molec. Cell. Biol. 8:3129; human IL-2, Smith et al., 
(1985) Proc. Nat'l Acad. Sci. USA , 82:8404; mouse IL-3, 
(Miyajima et al. , (1987) Gene 58:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 2:99, can also 
be used to provide for secretion in insects. 

10 

A recombinant polypeptide or polyprotein may be expressed 
intracellularly or, if it is expressed with the proper 
regulatory sequences, it can be secreted. Good 
intracellular expression of nonfused foreign proteins 

15 usually requires heterologous genes that ideally have a 
short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. If 
desired, methionine at the N- terminus may be cleaved from 
the mature protein by in vitro incubation with cyanogen 

20 bromide. 

Alternatively, recombinant polyproteins or proteins which 
are not naturally secreted can be secreted from the insect 
cell by creating chimeric DNA molecules that encode a fusion 

.25 protein comprised of a leader sequence fragment that 
provides for secretion of the foreign protein in insects. 
The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct 
the translocation of the protein into the endoplasmic 

30 reticulum. 

After insertion of the DNA sequence and/or the gene encoding 
the expression product precursor of the protein, an insect 
cell host is co-transformed with the heterologous DNA of the 
35 transfer vector and the genomic DNA of wild type baculovirus 
usually by co-transfection. The promoter and 
transcription termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. 
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Methods for introducing heterologous DNA into the desired 
site in the baculovirus virus are known in the art. (See 
Summers and Smith supra ; Ju et al. (1987); Smith et al., 
Mol. Cell. Biol. (1983) 2:2156; and Luckow and Summers 
5 (1989)) . For example, the insertion can be into a gene such 
as the polyhedrin gene, by homologous double crossover 
recombination; insertion can also be into a restriction 
enzyme site engineered into the desired baculovirus gene. 
Miller et al. , (1989) , Bioessavs 4: 91. The DNA sequence, when 
10 cloned in place of the polyhedrin gene in the expression 
vector, is flanked both 5 1 and 3' by polyhedrin-specif ic 
sequences and is positioned downstream of the polyhedrin 
promoter . 

15 The newly formed baculovirus expression vector is 
subsequently packaged into an infectious recombinant 
baculovirus. Homologous recombination occurs at low 
frequency (between about 1% and about 5%); thus, the 
majority of the virus produced after cotransf ection is still 

20 wild-type virus. Therefore, a method is necessary to 
identify recombinant viruses. An advantage of the 
expression system is a visual screen allowing recombinant 
viruses to be distinguished. The polyhedrin protein, which 
is produced by the native virus, is produced at very high 

25 levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms 
occlusion bodies that also contain embedded particles. 
These occlusion bodies, up to 15 fim in size, are highly 
refract ile, giving them a bright shiny appearance that is 

30 readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To 
distinguish recombinant virus from wild-type virus, the 
transfection supernatant is plaqued onto a monolayer of 
insect cells by techniques known to those skilled in the 

35 art. Namely, the plaques are screened under the light 
microscope for the presence (indicative of wild-type virus) 
or absence (indicative of recombinant virus) of occlusion 
bodies. "Current Protocols in Microbiology" Vol. 2 (Ausubel 
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et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, 
supra : Miller et al. (1989). 

Recombinant baculovirus expression vectors have been 
5 developed for infection into several insect cells. For 
example, recombinant baculoviruses have been developed for, 
inter alia : Aedes aeovpti , Autoaraph a calif ornica. Borobyx 
mori . Drosophiln Tnelanoaaster . Spodoptera fruqiperda, and 
Trichoplusia ni fPCT Pub. No. WO 89/046699; Carbonell et 
10 al., (1985) J. Virol. 56:153; Wright (1986) Nature 321:718; 
Smith et al., (1983) Mol. Cell. Biol. 3:2156; and see 
generally, Fraser, et al. (1989) In Vitro Cell. Dev. Biol. 
25:225) . 

15 Cells and cell culture media are commercially available for 
both direct and fusion expression of heterologous 
polypeptides in a baculovirus /express ion system; cell 
culture technology is generally known to those skilled in 
the art. See, e.g. . Summers and Smith supra. 

20 

The modified insect cells may then be grown in an 
appropriate nutrient medium, which allows for stable 
maintenance of the plasmid(s) present in the modified insect 
host. Where the expression product gene is under inducible 

25 control, the host may be grown to high density, and 
expression induced. Alternatively, where expression is 
constitutive, the product will be continuously expressed 
into the medium and the nutrient medium must be continuously 
circulated, while removing the product of interest and 

30 augmenting depleted nutrients. The product may be purified 
by such techniques as chromatography, e.g. , HPLC, affinity 
chromatography, ion exchange chromatography, etc. ; 
electrophoresis; density gradient centrifugation; solvent 
extraction, or the like. As appropriate, the product may be 

35 further purified, as required, so as to remove substantially 
any insect proteins which are also secreted in the medium or 
result from lysis of insect cells, so as to provide a 
product which is at least substantially free of host debris, 
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e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host 
cells derived from the transf ormants are incubated under 
5 conditions which allow expression of the recombinant protein 
encoding sequence. These conditions will vary, dependent 
upon the host cell selected. However, the conditions are 
readily ascertainable to those of ordinary skill in the art, 
based upon what is known in the art. 

10 

iii. Bacterial Systems 

Bacterial expression techniques are known in the art. A 
bacterial promoter is any DNA sequence capable of binding 

15 bacterial RNA polymerase and initiating the downstream (3 11 ) 
transcription of a coding sequence (e.g. structural gene) 
into mRNA. A promoter will have a transcription initiation 
region which is usually placed proximal to the 5 1 end of the 
coding sequence. This transcription initiation region 

20 usually includes an RNA polymerase binding site and a 
transcription initiation site. A bacterial promoter may 
also have a second domain called an operator, that may 
overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated 

25 (inducible) transcription, as a gene repressor protein may 
bind the operator and thereby inhibit transcription of a 
specific gene. Constitutive expression may occur in the 
absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved 

30 by a gene activator protein binding sequence, which, if 
present is usually proximal (5 1 ) to the RNA polymerase 
binding sequence. An example of a gene activator protein is 
the catabolite activator protein (CAP) , which helps initiate 
transcription of the lac operon in Escherichia t coli (E. 

35 coli) [Raibaud et al. (1984) Annu. Rev. Genet. 18;173] . 
Regulated expression may therefore be either positive or 
negative, thereby either enhancing or reducing 
transcription. 
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Sequences encoding metabolic pathway enzymes provide 
particularly useful promoter sequences. Examples include 
promoter sequences derived from sugar metabolizing enzymes, 
5 such as galactose, lactose (lac) [Chang et al. (1977) Nature 
198 ; 10561 , and maltose. Additional examples include 
promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp_) [Goeddel et al. (1980) Nuc. Acids Res. 
8:4057; Yelverton et al. (1981) Nucl. Acids Res. 9; 731; U.S. 

10 Patent No. 4,738,921; EPO Publ. Nos. 036 776 and 121 775]. 
The g-laotamase (bla) promoter system [Weissmann (1981) "The 
cloning of interferon and other mistakes." In interferon 3 
(ed. I. Gresser) ] , bacteriophage lambda PL [Shimatake et al. 
(1981) Nature 292 ; 1281 and T5 [U.S. Patent No. 4,689,406] 

15 promoter systems also provide useful promoter sequences. 

In addition, synthetic promoters which do not occur in 
nature also function as bacterial promoters. For example, 
transcription activation sequences of one bacterial or 

20 bacteriophage promoter may be joined with the operon 
sequences of another bacterial or bacteriophage promoter, 
creating a synthetic hybrid promoter [U.S. Patent 
No. 4,551,433]. For example, the tac promoter is a hybrid 
tro-lac promoter comprised of both trp promoter and lac 
. 25 operon sequences that is regulated by the lac repressor 
[Amann et al. (1983) Gene 25:167; de Boer et al. (1983) 
Proc. Natl. Acad. Sci. 80:21]. Furthermore, a bacterial 
promoter can include naturally occurring promoters of non- 
bacterial origin that have the ability to bind bacterial RNA 

30 polymerase and initiate transcription. A naturally 
occurring promoter of non-bacterial origin can also be 
coupled with a compatible RNA polymerase to produce high 
levels of expression of some genes in prokaryotes. The 
bacteriophase T7 RNA polymerase/promoter system is an 

35 example of a coupled promoter system [Studier e£ al. (1986) 
.T. Mol. Biol. 189 :113: Tabor e£ aj,. (1985) Proc Natl. Acad. 
Sci. 82:1074]. In addition, a hybrid promoter can also be 
comprised of a bacteriophage promoter and an EL. coli 
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operator region (EPO Publ. No. 267 851). 

In addition to a functioning promoter sequence, an efficient 
ribosome binding site is also useful for the expression of 
5 foreign genes in prokaryotes. In E. coli , the ribosome 
binding site is called the Shine-Dalgarno (SD) sequence and 
includes an initiation codon (ATG) and a sequence 3-9 
nucleotides in length located 3-11 nucleotides upstream of 
the initiation codon [Shine et al. (1975) Nature 254:34]. 

10 The SD sequence is thought to promote binding of mRNA to the 
ribosome by the pairing of bases between the SD sequence and 
the 3» and of E. coli 16S rRNA [Steitz et al. (1979) 
"Genetic signals and nucleotide sequences in messenger RNA." 
In Biological Regulation and Development: Gene Expression 

15 (ed. R.F. Goldberger) ] . To express eukaryotic genes and 
prokaryotic genes with weak ribosome-binding site [Sambrook 
et al . (1989) "Expression of cloned genes in Escherichia 
coli." In Molecular Cloning: A Laboratory Manual !. 

20 A DNA molecule may be expressed intracellular ly. A promoter 
sequence may be directly linked with the DNA molecule, in 
which case the first amino acid at the N-terminus will 
always be a methionine, which is encoded by the ATG start 
codon. If desired, methionine at the N-terminus may be 

25 cleaved from the protein by in vitro incubation with 
cyanogen bromide or by either in vivo on in vitro incubation 
with a bacterial methionine N-terminal peptidase (EPO Publ. 
No. 219 237) . 

30 Fusion proteins provide an alternative to direct expression. 
Usually, a DNA sequence encoding the N-terminal portion of 
an endogenous bacterial protein, or other stable protein, is 
fused to the 5 1 end of heterologous coding sequences. Upon 
expression, this construct will provide a fusion of the two 

35 amino acid sequences. For example, the bacteriophage lambda 
cell gene can be linked at the 5 1 terminus of a foreign gene 
and expressed in bacteria. The resulting fusion protein 
preferably retains a site for a processing enzyme (factor 
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Xa) to cleave the bacteriophage protein from the foreign 
gene [Nagai et al. (1984) Nature 309 :8101 . Fusion proteins 
can also be made with sequences from the lac Z [Jia et al. 
(1987) Gene 60:197] , trpE [Allen et al. (1987) J. 
5 Biotechnol. 5:93; Makoff et al. (1989) J. Gen. M icrobiol. 
135:11], and Chev [EPO Publ. No. 324 647] genes. The DNA 
sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made 
10 with the ubiquitin region that preferably retains a site for 
a processing enzyme (e.g. ubiquitin specific processing- 
protease) to cleave the ubiquitin from the foreign protein. 
Through this method, native foreign protein can be isolated 
[Miller et al. (1989) Bio /Technology 7:698]. 

15 

Alternatively, foreign proteins can also be secreted from 
the cell by creating chimeric DNA molecules that encode a 
fusion protein comprised of a signal peptide sequence 
fragment that provides for secretion of the foreign protein 

20 in bacteria [U.S. Patent No. 4,336,336].. The signal 
sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the secretion of the 
protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the 

25 periplasmic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria) . Preferably 
there are processing sites, which can be cleaved either in 
vivo or in vitro encoded between the signal peptide fragment 
and the foreign gene. 

30 

DNA encoding suitable signal sequences can be derived from 
genes for secreted bacterial proteins, such as the E. coli 
outer membrane protein gene (ompA) [Masui et al. (1983) , in: 
Experimental Manipulation of Gene Expression ; Ghrayeb et al. 
35 (1984) EMBO J. 2*2437] and the E. coli alkaline phosphatase 
signal sequence f phoA ) [Oka et al. (1985) Proc. Natl. Acad. 
Sci. 82:7212]. As an additional example, the signal 
sequence of the alpha-amylase gene from various Bacillus 
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strains can be used to secrete heterologous proteins from 
B. subtil is [Palva et al. (1982) Proc. Natl, Acad. Sci. USA 
2£:5582; EPO Publ. No. 244 042]. 

5 Usually, transcription termination sequences recognized by 
bacteria are regulatory regions located 3 1 to the 
translation stop codon, and thus together with the promoter 
flank the coding sequence. These sequences direct the 
transcription of an mRNA which can be translated into the 

10 polypeptide encoded by the DNA. Transcription termination 
sequences frequently include DNA sequences of about 50 
nucleotides capable of forming stem loop structures that aid 
in terminating transcription. Examples include 

transcription termination sequences derived from genes with 

15 strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, .comprising a 
promoter, signal sequence (if desired) , coding sequence of 

20 interest, and transcription termination sequence, are put 
together into expression constructs. Expression constructs 
are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable 
maintenance in a host, such as bacteria. The replicon will 

25 have a replication system, thus allowing it to be maintained 
in a procaryotic host either for expression or for cloning 
and amplification. In addition, a replicon may be either a 
high or low copy number plasmid. A high copy number plasmid 
will generally have a copy number ranging from about 5 to 

30 about 200, and usually about 10 to about 150. A host 
containing a high copy number plasmid will preferably 
contain at least about 10, and more preferably at least 
about 20 plasmids. Either a high or low copy number vector 
may be selected, depending upon the effect of the vector and 

35 the foreign protein on the host. 

Alternatively, the expression constructs can be integrated 
into the bacterial genome with an integrating vector. 
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Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the 
vector to integrate. Integrations appear to result from 
recombinations between homologous DNA in the vector and the 
5 bacterial chromosome. For example, integrating vectors 
constructed with DNA from various Bacillus strains integrate 
into the Bacillus chromosome (EPO Publ. No. 127 328). 
Integrating vectors may also be comprised of bacteriophage 
or transposon sequences. 

10 

Usually, extrachromosomal and integrating expression 
constructs may contain selectable markers to allow for the 
selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host 

15 and may include genes which render bacteria resistant to 
drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin (neomycin), and tetracycline [Davies et al. (1978) 
Annu. Rev . Microbiol. 32:469]. Selectable markers may also 
include biosynthetic genes, such as those in the histidine, 

20 tryptophan, and leucine biosynthetic pathways - 

Alternatively, some of the above described components can be 
put together in transformation vectors • Transformation 
vectors are usually comprised of a selectable market that is 
25 either maintained in a replicon or developed into an 
integrating vector, as described above. 

Expression and transformation vectors, either extra- 
chromosomal replicons or integrating vectors, have been 

30 developed for transformation into many bacteria. For 
example, expression vectors have been developed for, inter 
alia , the following bacteria; Bacillus subtilis [Palva et 
al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. 
Nos- 036 259 and 063 953; PCT Publ. No, WO 84/04541], 

35 Escherichia coli [Shimatake et al. (1981) Nature 292:128; 
Amann et al. (1985) Gene 40:183; Studier et al. (1986) 
Mol. Biol. 189 :113: EPO Publ. Nos. 036 776, 136 829 and 136 
907] , Streptococcus cremoris [Powell et al. (1988) Appl. 
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Environ. Microbiol. 54:655]: Streptococcus lividans [Powell 
et al. (1988) Appl. Environ. Microbiol. 54 : 655] , 
Streptomyces lividans [U.S. Patent No. 4,745,056]. 

5 Methods of introducing exogenous DNA into bacterial hosts 
are well-known in the art, and usually include either the 
transformation of bacteria treated with CaCl 2 or other 
agents, such as divalent cations and DMSO. DNA can also be 
introduced into bacterial cells by electroporation. 

10 Transformation procedures usually vary with the bacterial 
species to be transformed. See e.g., [Masson et al. (1989) 
FEMS Microbiol. Lett. 6J):273; Palva et al. (1982) Proc. 
Natl. Acad. Sci. USA 79 :5582; EPO Publ. Nos. 036 259 and 063 
953; PCT Publ. No. WO 84/04541, Bacillus], [Miller et al. 

15 (1988) Proc. Natl. Acad. Sci. 85 :856; Wang et al. (1990) J. 
Bacteriol. 172:949, Campylobacter], [Cohen et al. (1973) 
Proc. Natl. Acad. Sci. 69:2110; Dower et al. (1988) Nucleic 
Acids Res. 16:6127; Kushner (1978) "An improved method for 
transformation of Escherichia coli with ColEl-derived 

20 plasmids. In Genetic Engineering: Proceedings of the 
International Symposium on Genetic Engineering ( eds . H.W. 
Boyer and S. Nicosia); Mandel et al. (1970) J. Mol. Biol. 
53:159; Taketo (1988) Biochim. Biophvs. Acta 949 :318; 
Escherichia], [Chassy et al- (1987) FEMS Microbiol. Lett. 

25 44:173 Lactobacillus]; [Fiedler et al. (1988) Anal. Biochem 
170 :38. Pseudomonas] ; [Augustin et al. (1990) FEMS 
Microbiol. Lett. 66:203, Staphylococcus], [Barany et al . 
(1980) J. Bacteriol. 144 :698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, 
30 in: Streptococcal Genetics (ed. J. Ferretti and R. Curtiss 
III); Perry et al. (1981) Infec. Immun. 32:1295; Powell et 
al. (1988) AppI. Environ. Microbiol. 54:655; Somkuti et al . 
(1987) Proc. 4th Evr. Cong. Biotechnology 1:412, 
Streptococcus] . 

35 

iv. Yeast Expression 

Yeast -expression systems are also known to one of ordinary 
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skill in the art. A yeast promoter is any DNA sequence 
capable of binding yeast RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g. 
structural gene) into mRNA. A promoter will have a 
5 transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This 
transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription 
initiation site. A yeast promoter may also have a second 

10 domain called an upstream activator sequence (DAS) , which, 
if present, is usually distal to the structural gene. The 
UAS permits regulated (inducible) expression. Constitutive 
expression occurs in the absence of a DAS. Regulated 
expression may be either positive or negative, thereby 

15 either enhancing or reducing transcription. 

Yeast is a fermenting organism with an active metabolic 
pathway, therefore sequences encoding enzymes in the 
metabolic pathway provide particularly useful promoter 

20 sequences. Examples include alcohol dehydrogenase (ADH) 
(EPO Publ. No. 284 044), enolase, glucokinase, glucose-6- 
phosphate isomerase, glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH) , hexokinase, 
phosphofructokinase, 3 -phosphogly cerate mutase, and pyruvate 

25 kinase (PyK) (EPO Publ. No. 329 203) . The yeast PH05 gene, 
encoding acid phosphatase, also provides useful promoter 
sequences [Myanohara et al- (1983) Proc. Natl. Acad. Sci. 
PSA 80; 11 . 

30 In addition, synthetic promoters which do not occur in 
nature also function as yeast promoters. For example, DAS 
sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, 
creating a synthetic hybrid promoter. Examples of such 

35 hybrid promoters include the ADH regulatory sequence linked 
to the GAP transcription activation region (D.S. Patent Nos. 
4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the regulatory 
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sequences of either the ADH2, GAL4, GAL10 . OR PHQ5 genes, 
combined with the transcriptional activation region of a 
glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 
556). Furthermore, a yeast promoter can include naturally 
5 occurring promoters of non-yeast origin that have the 
ability to bind yeast RNA polymerase and initiate 
transcription. Examples of such promoters include, inter 
alia , [Cohen et al. (1980) Proc. Natl. Acad. Sci. USA 
77:1078; Henikoff et al. (1981) Nature Z83:835; Hollenberg 

10 et al. (1981) Curr. Topics Microbiol. Immunol. 96:119; 
Hollenberg et al. (1979) "The Expression of Bacterial 
Antibiotic Resistance Genes i the Yeast Saccharomyces 
cerevisiae, 11 in: Plasmids of Medical , Environmental and 
Commercial Importance (eds. K>N> Timmis and A. Puhler) ; 

15 Mercerau-Puigalon et al. (1980) Gene 11:163; Panthier et al. 
(1980) Curr. Genet. 2:109:1. 

A DNA molecule may be expressed intracellular ly in yeast. 
A promoter sequence may be directly linked with the DNA 
20 molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a 
methionine, which is encoded by the ATG start codon. If 
desired, methionine at the N-terminus may be cleaved from 
the protein by in vitro incubation with cyanogen bromide. 

25 

Fusion proteins provide an alternative for yeast expression 
systems, as well as in mammalian, baculovirus, and bacterial 
expression systems. Usually, a DNA sequence encoding the N- 
terminal portion of an endogenous yeast protein, or other 

30 stable protein, is fused to the 5 1 end of heterologous 
coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For 
example, the yeast or human superoxide dismutase (SOD) gene, 
can be linked at the 5' terminus of a foreign, gene and 

35 expressed in yeast. The DNA sequence at the junction of the 
two amino acid sequences may or may not encode a cleavable 
site. See e.g., EPO Publ. No. 196 056. Another example is 
a ubiquitin fusion protein. Such a fusion protein is made 
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with the ubiquitin region that preferably retains a site for 
a processing enzyme (e.g. ubiquitin-specific processing 
protease) to cleave the ubiquitin from the foreign protein. 
Through this method, therefore, native foreign protein can 
5 be isolated (see, e.g., PCT Publ. No. WO 88/024066). 

Alternatively, foreign proteins can also be secreted from 
the cell into the growth media by creating chimeric DNA 
molecules that encode a fusion protein comprised of a leader 

10 sequence fragment that provide for secretion in yeast of the 
foreign protein. Preferably, there are processing sites 
encoded between the leader fragment and the foreign gene 
that can be cleaved either in vivo or in vitro. The leader 
sequence fragment usually encodes a signal peptide comprised 

15 of hydrophobic amino acids which direct the secretion of the 
protein from the cell. 

DNA encoding suitable signal sequences can be derived from 
genes for secreted yeast proteins, such as the yeast 
20 invertase gene (EPO Publ. No. 012 873,* JPO Publ. No. 
62,096,086) and the A- factor gene (U.S. Patent No. 
4,588,684). Alternatively, leaders of non-yeast origin, 
such as an interferon leader, exist that also provide for 
secretion in yeast (EPO Publ. No. 060 057). 

25 

A preferred class of secretion leaders are those that employ 
a fragment of the yeast alpha-factor gene, which contains 
both a "pre" signal sequence, and a "pro" region. The types 
of alpha-factor fragments that can be employed include the 

30 full-length pre-pro alpha factor leader (about 83 amino acid 
residues) as well as truncated alpha-factor leaders (usually 
about 25 to about 50 amino acid residues) (U.S. Patent Nos. 
4,546,083 and 4,870,008; EPO Publ. No. 324 274). Additional 
leaders employing an alpha-factor leader fragment that 

35 provides for secretion include hybrid alpha-factor leaders 
made with a presequence of a first yeast, but a pro-region 
from a second yeast alphaf actor. (See e.g., PCT Publ. No. 
WO 89/02463.) 



WO 93/13202 



41 



PCT/EP92/03016 



Usually, transcription termination sequences recognized by 
yeast are regulatory regions located 3 1 to the translation 
stop codon, and thus together with the promoter flank the 
5 coding sequence. These sequences direct the transcription 
of an mRNA which can be translated into the polypeptide 
encoded by the DNA. Examples of transcription terminator 
sequence and other yeast-recognized termination sequences, 
such as those coding for glycolytic enzymes. 

10 

Usually, the above described components, comprising a 
promoter, leader (if desired) , coding sequence of interest, 
and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often 

15 maintained in a replicon, such as an extrachromosomal 
element (e.g., plasmids) capable of stable maintenance in a 
host, such as yeast or bacteria. The replicon may have two 
replication systems, thus allowing it to be maintained, for 
example, in yeast for expression and in a procaryotic host 

20 for cloning and amplification. Examples of such yeast- 
bacteria shuttle vectors include YEp24 [Botstein et al . 
(1979) Gene 8:17-24] , pCl/1 [Brake et al. (1984) Proc. Natl. 
Acad. Sci USA 81:4642-4646], and YRpl7 [Stinchcomb et al. 
(1982) J. Mol. Biol. 158:157]. In addition, a replicon may 

25 be either a high or low copy number plasmid. A high copy 
number plasmid will generally have a copy number ranging 
from about 5 to about 200, and usually about 10 to about 
150. A host containing a high copy number plasmid will 
preferably have at least about 10, and more preferably at 

30 least about 20. Enter a high or low copy number vector may 
be selected, depending upon the effect of the vector and the 
foreign protein on the host. See e.g., Brake et al. , supra . 

Alternatively, the expression constructs can be integrated 
35 into the yeast genome with an integrating vector. 
Integrating vectors usually contain at least one sequence 
homologous to a yeast chromosome that allows the vector to 
integrate, and preferably contain two homologous sequences 
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flanlcing the expression construct. Integrations appear to 
result from recombinations between homologous DNA in the 
vector and the yeast chromosome [Orr-Weaver et al. (1983) 
Methods in Enzvmol. 101:228-245] . An integrating vector may 
5 be directed to a specific locus in yeast by selecting the 
appropriate homologous sequence for inclusion in the vector. 
See Orr-Weaver et al. , supra . One or more expression 
construct may integrate, possibly affecting levels of 
recombinant protein produced [Rine et al. (1983) Proc. Natl, 

10 Acad- Sci. USA 80:6750] . The chromosomal sequences included 
in the vector can occur either as a single segment in the 
vector, which results in the integration of the entire 
vector , or two segments homologous to adjacent segments in 
the chromosome and flanking the expression construct in the 

15 vector, which can result in the stable integration of only 
the expression construct. 

Usually, extrachromosomal and integrating expression 
constructs may contain selectable markers to allow for the 

20 selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can 
be expressed in the yeast host, such as ADE2 . HIS4, LEU2, 
TEP1, and ALG7, and the G418 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G418, 

25 respectively. In addition, a suitable selectable marker may 
also provide yeast with the ability to grow in the presence 
of toxic compounds, such as metal. For example, the 
presence of CUP1 allows yeast to grow in the presence of 
copper ions [Butt et al. (1987) Microbiol, Rev. 51:351]. 

30 

Alternatively, some of the above described components can be 
put together into transformation vectors. Transformation 
vectors are usually comprised of a selectable marker that is 
either maintained in a replicon or developed into an 
35 integrating vector, as described above. 

Expression and transformation vectors, either 
extrachromosomal replicons or integrating vectors., have been 
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developed for transformation into many yeasts. For example, 
expression vectors have been developed for, inter alia , the 
following yeasts: Candida albicans [Kurtz, et al. (1986) Mol. 
Cell. Biol. 6: 142] . Candida maltose [Kunze, et al. (1985) J\ 
5 Basic Microbiol. 25 :141]. Hansenula polymorpha [Gleeson, et 
al . (1986) J. Gen. Microbiol. 132 :3459; Roggenkamp et al. 
(1986) Mol. Gen. Genet. 202:302], Kluyveromyces fragilis 
[Das, et al. (1984) J. Bacteriol. 158:1165], Kluyveromyces 
lactis [De Louvencourt et al. (1983) J. Bacteriol. 154 :737 : 

10 Van den Berg et al. (1990) Bio/Technology 8:135], Pichia 
guillerimondii [Kunze et al. (1985) J. Basic Microbiol. 
25:141], Pichia pastoris [Cregg, et al. (1985) Mol. Cell. 
Biol. 5:3376; U.S. Patent Nos. 4,837,148 and 4,929,555], 
Saccharomyces cerevisiae [Hinnen et aJL. (1978) Proc. Natl. 

15 Acad. Sci. USA 75:1929; Ito et al. (1983) J. Bacteriol. 
153 :163] , Schizosaccharomyces pombe [Beach and Nurse (1981) 
Nature 300:706], and Yarrowia lipolytica [Davidow, et al. 

(1985) Curr. Genet. 10:380471 Gaillardin, et al. (1985) 
Curr. Genet. 10:49] . 

20 

Methods of introducing exogenous DNA into yeast hosts are 
well-known in the art, and usually include either the 
transformation of spheroplasts or of intact yeast cells 
treated with alkali cations. Transformation procedures 
25 usually vary with the yeast species to be transformed. See 
e.g., [Kurtz et al. (1986) Mol. Cell. Biol. 6:142; Kunze et 
al. (1985) J. Basic Microbiol. 25:141; Candida]; [Gleeson 
et al. (1986) J. Gen. Microbiol. 132:3459; Roggenkamp et al. 

(1986) Mol. Gen. Genet. 202:302; Hansenula]; [Das et al . 
30 (1984) J. Bacteriol. 158:1165; De Louvencourt et al. (1983) 

J. Bacteriol. 154 :1165: Van den Berg et al. (1990) 
Bio/Technology 8:135; Kluyveromyces]; [Cregg et al. (1985) 
Mol. Cell. Biol. 5:3376; Kunze et al. (1985) J. Basic 
Microbiol. 25:141: U.S. Patent Nos. 4,837,148 and ,4 , 929 , 555 ; 
35 Pichia]; [Hinnen et al. (1978) Proc. Natl. Acad. Sci. USA 
75; 1929; Ito et al. (1983) J. Bacteriol. 153:163 
Saccharomyces]; [Beach and Nurse (1981) Nature 300 :706; 
Schizosaccharomyces]; [Davidow et al. (1985) Curr. Genet. 
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10:39; Gaillardin et al. (1985) Curr. Genet. 10:49; 

Yarrowia] . 

Example 1 - Detoxified LT 

5 

A fragment of the gene for LT was extracted from plasmid 
EWD299 [Dallas W.S., Gill D.M. and Falkow S., 1979, J. 
Bacterid., 139 . 850-858] by digestion with the restriction 
enzymes Smal and EcoRI, and was recloned in the vector 
10 Bluescript KS suitable for producing single strands of DNA 
[Sambrook J., Fritsch E. and Maniatis, T. "Molecular 
Cloning" , Cold Spring Harbor] . 

BW313 cells were transformed by the clones thus obtained and 
15 allowed to grow for 14 hours in a culture medium consisting 
of Luria Broth with the addition of 1 /xg/ml of uridine. 

A series of synthetic oligonucleotides (listed in Table 1 
below) , containing the mutation, or the desired bases 
20 instead of the natural ones, and a seguence of 10 bases 
upstream and 10 downstream of the same mutation, identical 
to the natural ones, was first of all synthesised chemically 
and then phosphorylated , 1.5 pmol thereof being treated at 
37 °C with 5 units of kinase. 

25 

After halting the reaction with a 100 mM EDTA solution, the 
oligonucleotides were annealed to the single strand 
containing the LT gene, by heating for 5 minutes at 70 °C and 
cooling slowly for about one hour in ice. 

30 

At that stage there was added to this cold solution (25 pi) 
a solution of free nucleotides, the enzyme DNA ligase and 
the enzyme DNA polymerase, in a final volume of 100 pi. 

0 

35 The solution thus obtained was kept for five minutes in ice, 
five minutes at ambient temperature and two hours at 37 °C. 

Suitable cells of E. coli were transformed with the reaction 
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mixture, in accordance with the usual techniques [Sambrook 
J., Fritsch E. and Maniatis T. "Molecular Cloning" Cold 
Spring Harbor] , and the site-directed mutagenesis was 
checked by sequencing of the clones obtained. 

5 

The Smal-EcoRI fragment containing the various mutations was 
substituted for the original Smal-EcoRI insert in the 
plasmid EWD299. 

10 The strains which encode the mutated toxin were then grown 
in 10 ml of Luria Broth for 12 hours at 37 °C 

The cultures were centrifuged and the precipitate containing 
the cells was resuspended in 300 ml of a solution containing 
15 25% of sucrose and 50 mM of Tris buffer at pH8, and the 
mixture was treated for one hour at ambient temperature with 
1 mg/ml of a solution of Polymixin B. 

The presence of the toxoid in the periplasmatic supernatant 
20 liquor was verified by means of Western Blot and its 
toxicity was evaluated by the inducement or lack of 
inducement of morphological changes in Yl cells (see Table 
1). 

25 Yl cells are adrenal tumour epithelial cells which become 
markedly more rounded when treated with a solution 
containing CT or LT [Yasamure Y., Buonassisi V. and Sato G., 
"Clonal analysis of differentiated function in animal cell - 
cultures", Cancer Res., 1966, 26, 529-535]. The toxicity of 

30 CT and LT is correlated with this morphological transition. 
The periplasmic supernatant is diluted with a solution of 
F10 medium, horse serum 1.5%, glutamine and gentamycin to 
lesser and lesser concentrations and Yl cells (250000 
cells/ml) are incubated with the resulting solutions for 48 

35 hours at 37<>C under an atmosphere of C0 2 . The morphology of 
the cells is evaluated. 

. In all cases, immunogenicity was shown by correct assembly 
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of the complete toxoid and by cross reaction of the toxoid 
with antibody to the wild type LT. 

The results are shown in Table I below. 

In this Table (and in Table II below) the toxicity symbols 
mean as follows: 



10 ++ 

+ 

15 



25 



30 



toxic after dilution 1:2000 (wild type toxicity) 
toxic up to dilution 1:250 

toxic up to dilution 1:64 

not toxic, even undiluted 

TABLE I 



Example Mutation 



20 1.1 LT 



1.2 
1.3 



LT 
LT 



1.4 LT 

1.5 LT 

1.6 LT 
1.8 LT 
1.9 



Val- 
Val- 
Val- 
Ser- 
Val- 
Val- 
His- 
Tyr- 



LT 

1.10 LT Tyr 

1.11 LT Tyr' 

1.12 LT Pro' 



•53 -Asp 

■53-Glu 

■53 -Tyr 

■63-Lys 

•97-Lys 

97 -Tyr 

107-Glu 

104-Lys 

104-Asp 

104-Ser 

106-Ser 



Oligonucleotide Sequence Toxicity 

2 9 1-ACCGGCTTTGATAGATATGAT- 3 11 

29 l-ACCGGCTTTGAAAGATATGAT-3 11 

29 l-ACCGGCTTTTACAGATATGAT-3 11 

322 -GTTTCCACTAAGCTTAGTTTG- 342 

424 -ATGTTTAATAAGAATGATGTA— 4 4 4 

424 — ATGTTTAATTACAATGATGTA-4 4 4 

454-TACAGCCCTGAGCCATATGAA-4 7 4 -f- 

445 -ATTAGCGTAAAGAGCCCT- 462 

44 5 -ATTAGCGTAGATAGCCCT-4 6 2 

44 7 -TAGCGTAAGTAGCCCTCA-4 64 

4 53 -ATACAGCAGCCACCCATA-47 0 



Two mutation of serine (Ser-114-Glu: 477-GGAGGTGAAGCGTTAGG- 
494 and Ser-l 14 — Lys : 4 7 7 -GGAGGTTAAAGCGTTAGG-4 9 4 ) were also 
shown to exhibit substantially reduced toxicity. 



35 



Comparative Examples 



A LT LT Wild Type «■ 

40 B LT Arg-210-Asp 769 -ATATATCTCAACGAATATCAA-7 8 9 + 

C LT Leu-41-Phe 113 -ATATTAATTTCTATGATC- 130 NA 

D LT His-44-Phe 1 2 1-CTTTATGATTTTGCGAGA- 138 NA 

E LT Ala-45-Tyr 12 5-ATGATCACTATAGAGGAA— 142 NA 

F LT Arg-54-Ala 152 -GCTTTGTCGCGTATGATG- 169 ++ 

45 G LT Arg-54-Lys 15 l-GGCTTTGTCAAGTATGATGAT-17 1 4+ 

H LT Tyr-59-Met 167 — ATGACGG AATGGTTTCCA- 18 4 # -Ht- 

I LT Val-60-Gly 169 -GACGGATATGGATCCACTTCT- 18 9 NA 

J LT Ser-68-Lys 193-AGTTTGAGAAAGGCTCACTTA-213 ++ 

K LT Ser-68-Pro 19 3 -AGTTTGAGACCAGCTCACTTA-2 13 NA 

50 L LT His-70-Pro 19 9 — AGAAGTGCTCCTTTAGCAGGA— 2 19 NA 

M LT Ala-72-Arg 20 5-GCTCACTTAAGGGGACAGTCT— 2 2 5 ++ 

N LT Ala-72-His 205 -GCTCACTTACATGGACAGTCT— 2 2 5 -H4- 

O LT Arg-192-Asn 565 -GATTCATC AATTACAATCACA— 5 8 5 4f4- 
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(NA means "not assembled", i.e the holotoxin AB 5 is not 
formed at all) 

5 Example 2 - Detoxified CT 

The procedure followed in the case of the gene for the toxin 
CT is analogous to that described above. 

10 A fragment containing the gene for a CT was amplified by 
means of the polymerase chain reaction (PCR) technique from 
plasmid.pCT322. An alternative and equivalent source of the 
CT gene is plasmid pJM17 (Pearson et al, PNAS USA, 79/ 
(1982), 2976-2980). 

15 

The following two synthetic primers were used: 
1 ) GGCAGATTCTAGACCTCCTGATGAAATAAA 
20 2 ) TGAAGTTTGGCGAAGCTTCTTAATTTGCCATACTAATTGCGGCAATCGCAT 

containing respectively an Xbal site and an artificial 
Hindlll site (shown underlined) . 

25 The resulting amplified fragment, Xbal-Hindlll, which has a 
length of 1074 base pairs, contains the codons of the two 
sub-units, A and B, but not the sequence encoding the leader 
peptide of the A sub-unit. This fragment was recloned in 
Bluescript KS vector and was treated in accordance with the 

30 procedure described above for LT, so as to effect the site- 
directed mutagenesis. 



WO 93/13202 



48 



PCT/EP92/03016 



10 



15 



20 



TABLE II 

Example Mutation Oligonucleotide Sequence Toxicity 

2.1 CTVal-53-Asp ACGGGATTTGACAGGCACGAT 

2.2 CTSer-63-Lys GTTTCCACCAAGATTAGTTTG 

2.3 CTVal-97-Lys ATGTTTAACAAGAATGATGTA 

2.4 CTSer-106-Pro GGCATACAGTAGCCATCCAGA 

Comparative Examples 

A CT Arg-192-Asn GAATGCTCCAAACTCATCGAT +++ 
B CTArg-54-His GGATTTGTTCATCACGATGAT ++ 

The following mutations also proved to abolish toxicity: 
107-Asn (TACAGTCCTAACCCAGATGAA) , Glu-110-Ser 
(TCATCCAGATTCGCAAGAAGT) , Glu-112-Ala (CAGATGAACAAGCTGTTTCTG) 
and Ser-114-Glu ( CAAG AAGTTGAAGCTTTAGGT ) . 

It will be understood that the invention is described above 
by way of example only and modifications of detail may be 
made within the scope and spirit of the invention. 
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CLAIMS : 

1. An immunogenic detoxified protein comprising the amino 
acid sequence of subunit A of a cholera toxin (CT-A) or a 

5 fragment thereof or the amino acid sequence of subunit A of 
an Escherichia coli heat labile toxin (LT-A) or a fragment 
thereof wherein one or more amino acids at, or in positions 
corresponding to Val-53, Ser-63, Val-97, Tyr-104 or.Pro-106 
are replaced with another amino acid. 

10 

2. An immunogenic detoxified protein according to claim 
1 wherein additionally one or more amino acids at, or in 
positions corresponding to Arg-7, Asp-9, Arg-11, His-44, 
Arg-54, Ser-61, His-70, His-107, Glu-110, Glu-112, Ser-114, 

15 Trp-127, Arg-146 or Arg-192 are replaced. 

3. An immunogenic detoxified protein according to claim 
1 or 2 comprising one or more of the following amino acid 
replacements Val-53 -Asp, Val-53-Glu, Val-53-Tyr, Ser-63-Lys, 

20 Val-97-Lys, Val-97-Tyr, His-107-Glu, Tyr-104-Lys, Tyr-104- 
Asp, Tyr-104-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

4. An immunogenic composition for use a vaccine 
comprising an immunogenic detoxified protein according to 

25 any one of the preceding claims and a pharmaceutically 
acceptable carrier. 

5. A vaccine composition comprising an immunogenic 
detoxified protein according to any one of claims 1 to 3 and 

30 a pharmaceutically acceptable carrier. 

6. A vaccine composition according to claim 5 further 
comprising an adjuvant. 

35 7. A DNA sequence encoding an immunogenic detoxified 
protein according to any one of claims l to 3. 

8. A vector carrying a DNA according to claim 7. 
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£0 

9 . A host cell line transformed with the vector according 
to claim 8. 

5 10. A process for the production of an immunogenic 
detoxified protein according to any one of claims 1 to 3 
comprising culturing a host cell according to claim 9. 

11. A process for the production of a DNA according to 
10 claim 7 comprising the steps of subjecting a DNA encoding 

a CT-A or an LT-A or a fragment thereof to site-directed 
mutagenesis. 

12. A method of vaccinating a mammal against Vibrio 
15 cholerae or an enterotoxigenic strain of Escherichia 

coli comprising administering an immunologically effective 
amount of an immunogenic detoxified protein according to any 
one of claims 1 to 3. 

20 13. A process for the formulation of a vaccine according 
to claim 5 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 3 into 
association with a pharmaceutically acceptable carrier. 

25 14. A process for the formulation of a vaccine according 
to claim 6 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 3 into 
association with an adjuvant. 
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LT2 1 -.-.FF T R-A L QQ-AYE PI 38 

LT1 1 FRS . 39 

LT1_1A 1 -G-R R HN 40 

CT 1 NDDKLYRADSRPPDEIKQSGGLMPRGQSEYFDRGTQMNIN 40 



— E V— NT— N TVT--Q 1 — N — GS- 78 

y 79 

LYDHARGTQTGFVRHDDGYVSTSISLRSAHLVGQTILSGH 80 



NE V-P L-D--G R Y-S-N-FA 118 

— LTIYI ... IS 116 

V ~Y— 120 

STYYIYVIATAPNMFNVNDVLGAYSPHPDEQEVSALGGIP 12 0 



L 1 SF-A-EGGMQ D — GDLF-G-TV — N— 158 

:— 156 

N 1 — R E R — N E- 160 

YSQI YGWYRVHFGVLDEQLHRNRGYRDRY YSNLD I APAAD 160 



— q SNFP M — STF — EQ-VPNNKEFK-GV-I 198 

196 

— R D-Q Q DSS-TITGD — N 200 

GYGLAGFPPEHRAWREEPWIHHAPPGCGNAPRSSMSNTCD 200 



SA-NV — KYD-MNFKKLL — RLALTFFM — D-F-GVHGE 241 

. . . 236 

-E--N-STIY-R D-.~EV-.IY .R 240 

EKTQSLGVKFLDEYQSKVKRQIFSGY . QSDID . THNRI . KDEL 240 



Figure 1 
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LT AATGGCGACAGATTATACCGTGCTGACTCTAGACCCCCAGATGAAATAAAACGTTTCCGG 

NGDRLYRADSRPPDE IKRFR 20 



NDDKLYRADSRPPDE IKQSG 20 
CT AATGATGATAAGTTATATCGGGCAGATTCTAGACCICCTGATGAAASAAAGCAGXCAGGT 

LT AGTCTTATGCCCAGAGGT • . • AATGAGT ACTTCG ATAGAGGAACTCAAATGAATATTAAT 

SLMPRGQNEYFDRGTQMNIN 39 



GLMPRGQSEYFDRGTQ MNIN 40 
CT GGTCTTATGCCAAGAGGACAGAGTGAGTACTTTGACCGAGGTACTCAAATGAATATCAAC 

XT CTTTATGATCACGCGAGAGGAACACAAACCGGCTTTGTCAGATATGATGACGGATATGTT 

LYDHARGTQTGFVRYDDGYV 59 



LYDHARGTQTGFVRHDDG YV 60 
CT CTTTATGATCATGCAAGAGGAACTCAGACGGGATTTGTTAGGCACGATGATGGATATGTT 

LT TCCACTTCTCTTAGTTTGAGAAGTGCTCACTTAGCAGGACAGTATATATTATCAGGATAT 

ST SLSLRSAHLAGQYILSGY 79 



S TSISLRSAHLVGQTILSGH 80 
CT TCCACCTCAATTAGTTTGAGAAGTGCCCACTTAGTGGGTCAAACTATATTGTCTGGTCAT 

XT TCACTTACTATATATATCGTTATAGCA AATATGTTTAATGTTAATGATGTA 

SLTIYIVIA NMFNVKDV 96 



STYYI YVIATAPNMFNVNDV 100 
CT TCTACTTATTATATATATGTTATAGCCACTGCACCCAACATGTTTAACGTTAATGATGTA 

LT ATTAGCGTATACAGCCCTCACCCATATGAACAGGAGGTTTCTGCGTTAGGTGGAATACCA 

ISVYSPHPYEQEVSALGGIP 116 



LGAYSPHPDEQEVSALGGIP 120 
CT TTAGGGGCATACAGTCCTCATCCAGATGAACAAGAAGTTTCTGCTTTAGGTGGGATTCCA 



Figure 2a 
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LT TATTCTCAGATATATGGATGGTATCGTGTTAATTTTGGTGTGATTGATGAACGATTACAT 

YSQI YGWYRVNFGVI DERLH 136 



YSQIYGWYRVHFGVLDEQLH 140 
CT TACTCCCAAATATATGGATGGTATCGAGTTCATTTTGGGGTGCTTGATGAACAATTACAT 

LT CGTAACAGGGAATATAGAGACCGGTATTACAGAAATCTGAATATAGCTCCGGCAGAGGAT 

RNREYRDRYYRNLNIAPAED 156 



R NRGYRDRYYSNLD X A P A A D 160 
CT CGTAATAGGGGCTACAGAGATAGATATTACAGTAACTTAGATATTGCTCCAGCAGCAGAT 

LT GGTTACAGATTAGCAGGTTTCCCACCGGATCACCAAGCTTGGAGAGAAGAACCCTGGATT 

GYRLAGFPPDHQAWRE EPWI 176 



GYG LAGFPPEHRAWREEPWI 180 
CT GGTTATGGATTGGCAGGTTTCCCTCCGGAGCATAGAGCTTGGAGGGAAGAGCCGTGGATT 

LT CATCATGCACCACAAGGTTGTGGAGATTCATCAAGAACAATCACAGGTGATACTTGTAAT 

HHAPQGCGDSSRTITGDTCN 196 



HHAPPGCGNAPRSSI SNTCD 200 
CT CATCATGCACCGCCGGGTTGTGGGAATGCTCCAAGATCATCGATCAGTAATACTTGCGAT 

LT GAGGAGACCCAGAATCTGAGCACAATATATCTCAGGGAATATCAATCAAAAGTTAAGAGG 

EETQNLSTIYLREYQSKVKR 216 



EKTQSLGVKFLDEYQSKVKR 220 
CT GAAAAAACCC AAAGT CTAGGTGTAAAATT CCTTG AC G AATA CC AATCT AAAGTTAAAAG A 

LT CAGATATTTTCAGACTATCAGTCAGAGGTTGACATATATAACAGAATTCGGGATGAATTATGA 
Q IFSDYQS EVDIYNRIRDEL * 



QIF SGYQSDIDTHNRIKDEL * 
CT CAAATATTTTCAGGCTATCAATCTGATATTGATACACATAATAGAATTAAGGATGAATTATGA 



Figure 2b 
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